Overview
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 2751 |
| Missing cells | 15932 |
| Missing cells (%) | 32.2% |
| Duplicate rows | 5 |
| Duplicate rows (%) | 0.2% |
| Total size in memory | 884.4 KiB |
| Average record size in memory | 329.2 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 8 |
| DateTime | 1 |
| Boolean | 1 |
Dataset
| Description | Quality-verified clinical data for JHB_Aurum_009 |
|---|---|
| Creator | HEAT Research Programme |
| Author | RP2 Clinical Data Team |
| URL | https://github.com/Logic06183/RP2_dataoverview |
Variable descriptions
| study_source | Study identifier |
|---|---|
| Age (at enrolment) | Patient age at study enrollment |
| Sex | Biological sex |
| Race | Racial/ethnic group |
| enrollment_date | Date of study enrollment |
| visit_date | Date of clinic visit |
| primary_date | Primary reference date |
| study_arm | Study treatment arm |
| study_visit | Study visit number |
| Antiretroviral Therapy Status | Current ART status |
| BMI (kg/m²) | Body Mass Index |
| weight_kg | Body weight in kilograms |
| height_m | Height in meters |
| Waist circumference (cm) | Waist circumference in centimeters |
| hip_circumference_cm | Hip circumference in centimeters |
| waist_hip_ratio | Waist-to-hip ratio |
| systolic_bp_mmHg | Systolic blood pressure |
| diastolic_bp_mmHg | Diastolic blood pressure |
| heart_rate_bpm | Heart rate in beats per minute |
| Respiratory rate (breaths/min) | Respiratory rate |
| Oxygen saturation (%) | Oxygen saturation |
| body_temperature_celsius | Body temperature in Celsius |
| CD4 cell count (cells/µL) | CD4+ T lymphocyte count |
| HIV viral load (copies/mL) | HIV RNA copies per mL |
| cd4_percent | CD4+ percentage |
| cd8_count_cells_uL | CD8+ T lymphocyte count |
| cd4_cd8_ratio | CD4/CD8 ratio |
| Hematocrit (%) | Hematocrit |
| hemoglobin_g_dL | Hemoglobin concentration |
| White blood cell count (×10³/µL) | Total WBC count |
| Red blood cell count (×10⁶/µL) | Total RBC count |
| Platelet count (×10³/µL) | Platelet count |
| MCV (MEAN CELL VOLUME) | Mean corpuscular volume |
| mch_pg | Mean corpuscular hemoglobin |
| mchc_g_dL | Mean corpuscular hemoglobin concentration |
| RDW | Red cell distribution width |
| Lymphocyte count (×10⁹/L) | Lymphocyte absolute count |
| Neutrophil count (×10⁹/L) | Neutrophil absolute count |
| Monocyte count (×10⁹/L) | Monocyte absolute count |
| Eosinophil count (×10⁹/L) | Eosinophil absolute count |
| Basophil count (×10⁹/L) | Basophil absolute count |
| lymphocyte_percent | Lymphocyte percentage |
| neutrophil_percent | Neutrophil percentage |
| monocyte_percent | Monocyte percentage |
| eosinophil_percent | Eosinophil percentage |
| basophil_percent | Basophil percentage |
| ALT (U/L) | Alanine aminotransferase |
| AST (U/L) | Aspartate aminotransferase |
| Alkaline phosphatase (U/L) | Alkaline phosphatase |
| Total bilirubin (mg/dL) | Total bilirubin |
| direct_bilirubin_mg_dL | Direct bilirubin |
| indirect_bilirubin_mg_dL | Indirect bilirubin |
| Albumin (g/dL) | Serum albumin |
| Total protein (g/dL) | Total serum protein |
| ggt_u_L | Gamma-glutamyl transferase |
| creatinine_umol_L | Serum creatinine (µmol/L) |
| creatinine_mg_dL | Serum creatinine (mg/dL) |
| creatinine clearance | Estimated creatinine clearance |
| bun_mg_dL | Blood urea nitrogen |
| urea_mmol_L | Serum urea |
| egfr_ml_min | Estimated glomerular filtration rate |
| Sodium (mEq/L) | Serum sodium |
| Potassium (mEq/L) | Serum potassium |
| chloride_mEq_L | Serum chloride |
| bicarbonate_mEq_L | Serum bicarbonate |
| calcium_mg_dL | Serum calcium |
| magnesium_mg_dL | Serum magnesium |
| phosphate_mg_dL | Serum phosphate |
| total_cholesterol_mg_dL | Total cholesterol |
| hdl_cholesterol_mg_dL | HDL cholesterol |
| ldl_cholesterol_mg_dL | LDL cholesterol |
| Triglycerides (mg/dL) | Triglycerides |
| vldl_cholesterol_mg_dL | VLDL cholesterol |
| cholesterol_hdl_ratio | Total cholesterol/HDL ratio |
| fasting_glucose_mmol_L | Fasting blood glucose (mmol/L) |
| glucose_mg_dL | Blood glucose (mg/dL) |
| hba1c_percent | Glycated hemoglobin |
| insulin_uIU_mL | Serum insulin |
| lactate_mmol_L | Blood lactate |
| crp_mg_L | C-reactive protein |
| esr_mm_hr | Erythrocyte sedimentation rate |
| pt_seconds | Prothrombin time |
| inr | International normalized ratio |
| aptt_seconds | Activated partial thromboplastin time |
| uric_acid_mg_dL | Serum uric acid |
| ldh_u_L | Lactate dehydrogenase |
| ck_u_L | Creatine kinase |
| amylase_u_L | Serum amylase |
| lipase_u_L | Serum lipase |
| climate_daily_mean_temp | Daily mean temperature |
| climate_daily_max_temp | Daily maximum temperature |
| climate_daily_min_temp | Daily minimum temperature |
| climate_temp_anomaly | Temperature anomaly from baseline |
| climate_heat_day_p90 | Heat day indicator (>90th percentile) |
| climate_heat_day_p95 | Heat day indicator (>95th percentile) |
| climate_heat_stress_index | Heat stress index |
| climate_humidity | Relative humidity |
| climate_precipitation | Precipitation |
| climate_season | Season |
| cd4_correction_applied | Quality flag: CD4 corrections applied |
| final_comprehensive_fix_applied | Quality flag: Comprehensive corrections applied |
| waist_circ_unit_correction_applied | Quality flag: Waist circumference unit corrected |
| sa_biomarker_standards | South African biomarker reference standards applied |
study_source has constant value "JHB_Aurum_009" | Constant |
final_comprehensive_fix_applied has constant value "1.0" | Constant |
waist_circ_unit_correction_applied has constant value "False" | Constant |
sa_biomarker_standards has constant value "1.0" | Constant |
| Dataset has 5 (0.2%) duplicate rows | Duplicates |
CD4 cell count (cells/µL) is highly overall correlated with cd4_correction_applied | High correlation |
cd4_correction_applied is highly overall correlated with CD4 cell count (cells/µL) | High correlation |
climate_daily_max_temp is highly overall correlated with climate_daily_mean_temp and 5 other fields | High correlation |
climate_daily_mean_temp is highly overall correlated with climate_daily_max_temp and 5 other fields | High correlation |
climate_daily_min_temp is highly overall correlated with climate_daily_max_temp and 5 other fields | High correlation |
climate_heat_day_p90 is highly overall correlated with climate_daily_max_temp and 6 other fields | High correlation |
climate_heat_day_p95 is highly overall correlated with climate_daily_max_temp and 6 other fields | High correlation |
climate_heat_stress_index is highly overall correlated with climate_daily_max_temp and 5 other fields | High correlation |
climate_season is highly overall correlated with climate_daily_max_temp and 6 other fields | High correlation |
climate_temp_anomaly is highly overall correlated with climate_heat_day_p90 and 2 other fields | High correlation |
cd4_correction_applied is highly imbalanced (85.9%) | Imbalance |
climate_heat_day_p90 is highly imbalanced (69.4%) | Imbalance |
climate_heat_day_p95 is highly imbalanced (69.4%) | Imbalance |
CD4 cell count (cells/µL) has 533 (19.4%) missing values | Missing |
HIV viral load (copies/mL) has 2461 (89.5%) missing values | Missing |
climate_daily_mean_temp has 1616 (58.7%) missing values | Missing |
climate_daily_max_temp has 1616 (58.7%) missing values | Missing |
climate_daily_min_temp has 1616 (58.7%) missing values | Missing |
climate_temp_anomaly has 1616 (58.7%) missing values | Missing |
climate_heat_day_p90 has 1616 (58.7%) missing values | Missing |
climate_heat_day_p95 has 1616 (58.7%) missing values | Missing |
climate_heat_stress_index has 1616 (58.7%) missing values | Missing |
climate_season has 1616 (58.7%) missing values | Missing |
HIV viral load (copies/mL) has 246 (8.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-25 05:10:07.797710 |
|---|---|
| Analysis finished | 2025-11-25 05:10:10.643284 |
| Duration | 2.85 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 188.1 KiB |
| JHB_Aurum_009 |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | JHB_Aurum_009 |
|---|---|
| 2nd row | JHB_Aurum_009 |
| 3rd row | JHB_Aurum_009 |
| 4th row | JHB_Aurum_009 |
| 5th row | JHB_Aurum_009 |
Common Values
| Value | Count | Frequency (%) |
| JHB_Aurum_009 | 2751 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| jhb_aurum_009 | 2751 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 5502 | |
| u | 5502 | |
| 0 | 5502 | |
| J | 2751 | |
| H | 2751 | |
| B | 2751 | |
| A | 2751 | |
| r | 2751 | |
| m | 2751 | |
| 9 | 2751 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11004 | |
| Uppercase Letter | 11004 | |
| Decimal Number | 8253 | |
| Connector Punctuation | 5502 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 2751 | |
| H | 2751 | |
| B | 2751 | |
| A | 2751 |
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 5502 | |
| r | 2751 | |
| m | 2751 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5502 | |
| 9 | 2751 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5502 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22008 | |
| Common | 13755 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 5502 | |
| J | 2751 | |
| H | 2751 | |
| B | 2751 | |
| A | 2751 | |
| r | 2751 | |
| m | 2751 |
Common
| Value | Count | Frequency (%) |
| _ | 5502 | |
| 0 | 5502 | |
| 9 | 2751 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35763 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 5502 | |
| u | 5502 | |
| 0 | 5502 | |
| J | 2751 | |
| H | 2751 | |
| B | 2751 | |
| A | 2751 | |
| r | 2751 | |
| m | 2751 | |
| 9 | 2751 |
Age (at enrolment)
Real number (ℝ)
Patient age at study enrollment
| Distinct | 59 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 6 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.426958 |
| Minimum | 15 |
|---|---|
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 27 |
| median | 33 |
| Q3 | 40 |
| 95-th percentile | 54 |
| Maximum | 76 |
| Range | 61 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 10.178108 |
|---|---|
| Coefficient of variation (CV) | 0.29564354 |
| Kurtosis | 0.24473046 |
| Mean | 34.426958 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.70885633 |
| Sum | 94502 |
| Variance | 103.59388 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 125 | 4.5% |
| 30 | 117 | 4.3% |
| 29 | 116 | 4.2% |
| 28 | 113 | 4.1% |
| 27 | 108 | 3.9% |
| 32 | 106 | 3.9% |
| 26 | 104 | 3.8% |
| 34 | 102 | 3.7% |
| 24 | 101 | 3.7% |
| 33 | 97 | 3.5% |
| Other values (49) | 1656 |
| Value | Count | Frequency (%) |
| 15 | 4 | 0.1% |
| 16 | 3 | 0.1% |
| 17 | 15 | 0.5% |
| 18 | 24 | 0.9% |
| 19 | 40 | 1.5% |
| 20 | 59 | |
| 21 | 56 | |
| 22 | 73 | |
| 23 | 85 | |
| 24 | 101 |
| Value | Count | Frequency (%) |
| 76 | 1 | < 0.1% |
| 74 | 1 | < 0.1% |
| 72 | 2 | 0.1% |
| 71 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 69 | 2 | 0.1% |
| 68 | 3 | |
| 67 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 65 | 5 |
Sex
Categorical
Biological sex
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4 |
| Missing (%) | 0.1% |
| Memory size | 165.9 KiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.7564616 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 1708 | |
| Female | 1039 | |
| (Missing) | 4 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 1708 | |
| female | 1039 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3786 | |
| a | 2747 | |
| l | 2747 | |
| M | 1708 | |
| F | 1039 | 8.0% |
| m | 1039 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10319 | |
| Uppercase Letter | 2747 | 21.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3786 | |
| a | 2747 | |
| l | 2747 | |
| m | 1039 | 10.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1708 | |
| F | 1039 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13066 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3786 | |
| a | 2747 | |
| l | 2747 | |
| M | 1708 | |
| F | 1039 | 8.0% |
| m | 1039 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13066 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3786 | |
| a | 2747 | |
| l | 2747 | |
| M | 1708 | |
| F | 1039 | 8.0% |
| m | 1039 | 8.0% |
primary_date
Date
Primary reference date
| Distinct | 447 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 43.0 KiB |
| Minimum | 2013-03-14 00:00:00 |
|---|---|
| Maximum | 2015-08-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
| Distinct | 854 |
|---|---|
| Distinct (%) | 38.5% |
| Missing | 533 |
| Missing (%) | 19.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 456.95807 |
| Minimum | 3 |
|---|---|
| Maximum | 2703 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 108.85 |
| Q1 | 272 |
| median | 416 |
| Q3 | 589 |
| 95-th percentile | 937 |
| Maximum | 2703 |
| Range | 2700 |
| Interquartile range (IQR) | 317 |
Descriptive statistics
| Standard deviation | 268.47946 |
|---|---|
| Coefficient of variation (CV) | 0.58753632 |
| Kurtosis | 7.1691831 |
| Mean | 456.95807 |
| Median Absolute Deviation (MAD) | 155 |
| Skewness | 1.6497118 |
| Sum | 1013533 |
| Variance | 72081.223 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350 | 9 | 0.3% |
| 315 | 9 | 0.3% |
| 500 | 9 | 0.3% |
| 467 | 9 | 0.3% |
| 420 | 8 | 0.3% |
| 336 | 8 | 0.3% |
| 443 | 8 | 0.3% |
| 354 | 8 | 0.3% |
| 414 | 8 | 0.3% |
| 564 | 8 | 0.3% |
| Other values (844) | 2134 | |
| (Missing) | 533 | 19.4% |
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 10 | 1 | |
| 15 | 1 | |
| 16 | 1 | |
| 20 | 1 | |
| 21 | 1 | |
| 28 | 1 | |
| 29 | 1 |
| Value | Count | Frequency (%) |
| 2703 | 1 | |
| 2609 | 2 | |
| 1996 | 1 | |
| 1781 | 1 | |
| 1725 | 1 | |
| 1577 | 1 | |
| 1568 | 1 | |
| 1564 | 1 | |
| 1549 | 1 | |
| 1508 | 1 |
| Distinct | 45 |
|---|---|
| Distinct (%) | 15.5% |
| Missing | 2461 |
| Missing (%) | 89.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20363.586 |
| Minimum | 0 |
|---|---|
| Maximum | 2670000 |
| Zeros | 246 |
| Zeros (%) | 8.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 7860.2 |
| Maximum | 2670000 |
| Range | 2670000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 196029.65 |
|---|---|
| Coefficient of variation (CV) | 9.6264796 |
| Kurtosis | 145.0072 |
| Mean | 20363.586 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.783887 |
| Sum | 5905440 |
| Variance | 3.8427622 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 246 | 8.9% |
| 8555 | 1 | < 0.1% |
| 378 | 1 | < 0.1% |
| 2435 | 1 | < 0.1% |
| 6442 | 1 | < 0.1% |
| 13795 | 1 | < 0.1% |
| 200 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 1898105 | 1 | < 0.1% |
| 132 | 1 | < 0.1% |
| Other values (35) | 35 | 1.3% |
| (Missing) | 2461 |
| Value | Count | Frequency (%) |
| 0 | 246 | |
| 10 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 51 | 1 | < 0.1% |
| 74 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 132 | 1 | < 0.1% |
| 143 | 1 | < 0.1% |
| 174 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2670000 | 1 | |
| 1898105 | 1 | |
| 650442 | 1 | |
| 164351 | 1 | |
| 149247 | 1 | |
| 125054 | 1 | |
| 44011 | 1 | |
| 38500 | 1 | |
| 34868 | 1 | |
| 22276 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 161.2 KiB |
| 0.0 | |
|---|---|
| 1.0 | 55 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 2696 | |
| 1.0 | 55 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 2696 | |
| 1.0 | 55 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5447 | |
| . | 2751 | |
| 1 | 55 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5502 | |
| Other Punctuation | 2751 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5447 | |
| 1 | 55 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2751 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8253 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5447 | |
| . | 2751 | |
| 1 | 55 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5447 | |
| . | 2751 | |
| 1 | 55 | 0.7% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 161.2 KiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 2751 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 2751 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| . | 2751 | |
| 0 | 2751 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5502 | |
| Other Punctuation | 2751 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| 0 | 2751 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2751 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8253 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| . | 2751 | |
| 0 | 2751 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| . | 2751 | |
| 0 | 2751 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.2 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 2751 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 161.2 KiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 2751 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 2751 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| . | 2751 | |
| 0 | 2751 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5502 | |
| Other Punctuation | 2751 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| 0 | 2751 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2751 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8253 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| . | 2751 | |
| 0 | 2751 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2751 | |
| . | 2751 | |
| 0 | 2751 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.451807 |
| Minimum | 9.356 |
|---|---|
| Maximum | 23.589 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 9.356 |
|---|---|
| 5-th percentile | 9.356 |
| Q1 | 13.213 |
| median | 14.195 |
| Q3 | 19.293 |
| 95-th percentile | 23.589 |
| Maximum | 23.589 |
| Range | 14.233 |
| Interquartile range (IQR) | 6.08 |
Descriptive statistics
| Standard deviation | 3.5385321 |
|---|---|
| Coefficient of variation (CV) | 0.22900442 |
| Kurtosis | -0.30036519 |
| Mean | 15.451807 |
| Median Absolute Deviation (MAD) | 0.982 |
| Skewness | 0.47348153 |
| Sum | 17537.801 |
| Variance | 12.521209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.293 | 214 | 7.8% |
| 13.213 | 208 | 7.6% |
| 14.195 | 187 | 6.8% |
| 13.868 | 144 | 5.2% |
| 9.356 | 98 | 3.6% |
| 18.203 | 67 | 2.4% |
| 23.589 | 62 | 2.3% |
| 13.656 | 53 | 1.9% |
| 13.316 | 41 | 1.5% |
| 17.799 | 39 | 1.4% |
| (Missing) | 1616 |
| Value | Count | Frequency (%) |
| 9.356 | 98 | |
| 13.213 | 208 | |
| 13.316 | 41 | 1.5% |
| 13.656 | 53 | 1.9% |
| 13.868 | 144 | |
| 14.195 | 187 | |
| 17.799 | 39 | 1.4% |
| 18.203 | 67 | 2.4% |
| 19.293 | 214 | |
| 20.293 | 22 | 0.8% |
| Value | Count | Frequency (%) |
| 23.589 | 62 | 2.3% |
| 20.293 | 22 | 0.8% |
| 19.293 | 214 | |
| 18.203 | 67 | 2.4% |
| 17.799 | 39 | 1.4% |
| 14.195 | 187 | |
| 13.868 | 144 | |
| 13.656 | 53 | 1.9% |
| 13.316 | 41 | 1.5% |
| 13.213 | 208 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.182599 |
| Minimum | 17.553 |
|---|---|
| Maximum | 30.083 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 17.553 |
|---|---|
| 5-th percentile | 17.553 |
| Q1 | 21.474 |
| median | 22.413 |
| Q3 | 26.343 |
| 95-th percentile | 30.083 |
| Maximum | 30.083 |
| Range | 12.53 |
| Interquartile range (IQR) | 4.869 |
Descriptive statistics
| Standard deviation | 2.9483779 |
|---|---|
| Coefficient of variation (CV) | 0.12718065 |
| Kurtosis | 0.15361931 |
| Mean | 23.182599 |
| Median Absolute Deviation (MAD) | 1.066 |
| Skewness | 0.324421 |
| Sum | 26312.25 |
| Variance | 8.6929324 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.343 | 214 | 7.8% |
| 22.23 | 208 | 7.6% |
| 23.023 | 187 | 6.8% |
| 21.347 | 144 | 5.2% |
| 17.553 | 98 | 3.6% |
| 22.413 | 67 | 2.4% |
| 30.083 | 62 | 2.3% |
| 21.474 | 53 | 1.9% |
| 20.768 | 41 | 1.5% |
| 25.8 | 39 | 1.4% |
| (Missing) | 1616 |
| Value | Count | Frequency (%) |
| 17.553 | 98 | |
| 20.768 | 41 | 1.5% |
| 21.347 | 144 | |
| 21.474 | 53 | 1.9% |
| 22.23 | 208 | |
| 22.413 | 67 | 2.4% |
| 23.023 | 187 | |
| 25.8 | 39 | 1.4% |
| 26.343 | 214 | |
| 26.769 | 22 | 0.8% |
| Value | Count | Frequency (%) |
| 30.083 | 62 | 2.3% |
| 26.769 | 22 | 0.8% |
| 26.343 | 214 | |
| 25.8 | 39 | 1.4% |
| 23.023 | 187 | |
| 22.413 | 67 | 2.4% |
| 22.23 | 208 | |
| 21.474 | 53 | 1.9% |
| 21.347 | 144 | |
| 20.768 | 41 | 1.5% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.5503286 |
| Minimum | 2.343 |
|---|---|
| Maximum | 14.954 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 2.343 |
|---|---|
| 5-th percentile | 2.343 |
| Q1 | 3.763 |
| median | 6.616 |
| Q3 | 11.253 |
| 95-th percentile | 14.954 |
| Maximum | 14.954 |
| Range | 12.611 |
| Interquartile range (IQR) | 7.49 |
Descriptive statistics
| Standard deviation | 4.0456474 |
|---|---|
| Coefficient of variation (CV) | 0.53582401 |
| Kurtosis | -1.0855077 |
| Mean | 7.5503286 |
| Median Absolute Deviation (MAD) | 2.853 |
| Skewness | 0.50562955 |
| Sum | 8569.623 |
| Variance | 16.367263 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.253 | 214 | 7.8% |
| 3.763 | 208 | 7.6% |
| 4.56 | 187 | 6.8% |
| 7.436 | 144 | 5.2% |
| 2.343 | 98 | 3.6% |
| 14.79 | 67 | 2.4% |
| 14.954 | 62 | 2.3% |
| 6.034 | 53 | 1.9% |
| 6.616 | 41 | 1.5% |
| 10.493 | 39 | 1.4% |
| (Missing) | 1616 |
| Value | Count | Frequency (%) |
| 2.343 | 98 | |
| 3.763 | 208 | |
| 4.56 | 187 | |
| 6.034 | 53 | 1.9% |
| 6.616 | 41 | 1.5% |
| 7.436 | 144 | |
| 10.493 | 39 | 1.4% |
| 11.253 | 214 | |
| 13.968 | 22 | 0.8% |
| 14.79 | 67 | 2.4% |
| Value | Count | Frequency (%) |
| 14.954 | 62 | 2.3% |
| 14.79 | 67 | 2.4% |
| 13.968 | 22 | 0.8% |
| 11.253 | 214 | |
| 10.493 | 39 | 1.4% |
| 7.436 | 144 | |
| 6.616 | 41 | 1.5% |
| 6.034 | 53 | 1.9% |
| 4.56 | 187 | |
| 3.763 | 208 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.1579163 |
| Minimum | 3.618 |
|---|---|
| Maximum | 10.271 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 3.618 |
|---|---|
| 5-th percentile | 3.618 |
| Q1 | 6.505 |
| median | 7.489 |
| Q3 | 9.042 |
| 95-th percentile | 10.271 |
| Maximum | 10.271 |
| Range | 6.653 |
| Interquartile range (IQR) | 2.537 |
Descriptive statistics
| Standard deviation | 2.2633511 |
|---|---|
| Coefficient of variation (CV) | 0.31620252 |
| Kurtosis | -0.9276673 |
| Mean | 7.1579163 |
| Median Absolute Deviation (MAD) | 1.553 |
| Skewness | -0.39512055 |
| Sum | 8124.235 |
| Variance | 5.1227584 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.489 | 214 | 7.8% |
| 3.654 | 208 | 7.6% |
| 7.602 | 187 | 6.8% |
| 10.271 | 144 | 5.2% |
| 6.918 | 98 | 3.6% |
| 3.618 | 67 | 2.4% |
| 9.042 | 62 | 2.3% |
| 9.839 | 53 | 1.9% |
| 7.913 | 41 | 1.5% |
| 10.025 | 39 | 1.4% |
| (Missing) | 1616 |
| Value | Count | Frequency (%) |
| 3.618 | 67 | 2.4% |
| 3.654 | 208 | |
| 6.505 | 22 | 0.8% |
| 6.918 | 98 | |
| 7.489 | 214 | |
| 7.602 | 187 | |
| 7.913 | 41 | 1.5% |
| 9.042 | 62 | 2.3% |
| 9.839 | 53 | 1.9% |
| 10.025 | 39 | 1.4% |
| Value | Count | Frequency (%) |
| 10.271 | 144 | |
| 10.025 | 39 | 1.4% |
| 9.839 | 53 | 1.9% |
| 9.042 | 62 | 2.3% |
| 7.913 | 41 | 1.5% |
| 7.602 | 187 | |
| 7.489 | 214 | |
| 6.918 | 98 | |
| 6.505 | 22 | 0.8% |
| 3.654 | 208 |
climate_heat_day_p90
Categorical
High correlation Imbalance Missing
Heat day indicator (>90th percentile)
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Memory size | 167.5 KiB |
| 0.0 | |
|---|---|
| 1.0 | 62 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1073 | |
| 1.0 | 62 | 2.3% |
| (Missing) | 1616 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1073 | |
| 1.0 | 62 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| . | 1135 | |
| 1 | 62 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2270 | |
| Other Punctuation | 1135 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| 1 | 62 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1135 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3405 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| . | 1135 | |
| 1 | 62 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| . | 1135 | |
| 1 | 62 | 1.8% |
climate_heat_day_p95
Categorical
High correlation Imbalance Missing
Heat day indicator (>95th percentile)
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Memory size | 167.5 KiB |
| 0.0 | |
|---|---|
| 1.0 | 62 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1073 | |
| 1.0 | 62 | 2.3% |
| (Missing) | 1616 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1073 | |
| 1.0 | 62 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| . | 1135 | |
| 1 | 62 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2270 | |
| Other Punctuation | 1135 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| 1 | 62 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1135 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3405 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| . | 1135 | |
| 1 | 62 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2208 | |
| . | 1135 | |
| 1 | 62 | 1.8% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.312848 |
| Minimum | 13.428 |
|---|---|
| Maximum | 27.393 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.0 KiB |
Quantile statistics
| Minimum | 13.428 |
|---|---|
| 5-th percentile | 13.639 |
| Q1 | 14.306 |
| median | 17.923 |
| Q3 | 21.523 |
| 95-th percentile | 27.393 |
| Maximum | 27.393 |
| Range | 13.965 |
| Interquartile range (IQR) | 7.217 |
Descriptive statistics
| Standard deviation | 3.536553 |
|---|---|
| Coefficient of variation (CV) | 0.19311867 |
| Kurtosis | 0.20907555 |
| Mean | 18.312848 |
| Median Absolute Deviation (MAD) | 3.6 |
| Skewness | 0.58250383 |
| Sum | 20785.083 |
| Variance | 12.507207 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21.523 | 214 | 7.8% |
| 19.275 | 208 | 7.6% |
| 17.347 | 187 | 6.8% |
| 14.306 | 144 | 5.2% |
| 13.639 | 98 | 3.6% |
| 17.923 | 67 | 2.4% |
| 27.393 | 62 | 2.3% |
| 13.428 | 53 | 1.9% |
| 15.721 | 41 | 1.5% |
| 19.958 | 39 | 1.4% |
| (Missing) | 1616 |
| Value | Count | Frequency (%) |
| 13.428 | 53 | 1.9% |
| 13.639 | 98 | |
| 14.306 | 144 | |
| 15.721 | 41 | 1.5% |
| 17.347 | 187 | |
| 17.923 | 67 | 2.4% |
| 19.275 | 208 | |
| 19.958 | 39 | 1.4% |
| 21.523 | 214 | |
| 22.526 | 22 | 0.8% |
| Value | Count | Frequency (%) |
| 27.393 | 62 | 2.3% |
| 22.526 | 22 | 0.8% |
| 21.523 | 214 | |
| 19.958 | 39 | 1.4% |
| 19.275 | 208 | |
| 17.923 | 67 | 2.4% |
| 17.347 | 187 | |
| 15.721 | 41 | 1.5% |
| 14.306 | 144 | |
| 13.639 | 98 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1616 |
| Missing (%) | 58.7% |
| Memory size | 170.8 KiB |
| Spring | |
|---|---|
| Winter | |
| Summer | |
| Autumn |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Autumn |
|---|---|
| 2nd row | Spring |
| 3rd row | Winter |
| 4th row | Spring |
| 5th row | Spring |
Common Values
| Value | Count | Frequency (%) |
| Spring | 609 | 22.1% |
| Winter | 295 | 10.7% |
| Summer | 129 | 4.7% |
| Autumn | 102 | 3.7% |
| (Missing) | 1616 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| spring | 609 | |
| winter | 295 | |
| summer | 129 | 11.4% |
| autumn | 102 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1033 | |
| n | 1006 | |
| i | 904 | |
| S | 738 | |
| p | 609 | |
| g | 609 | |
| e | 424 | |
| t | 397 | 5.8% |
| m | 360 | 5.3% |
| u | 333 | 4.9% |
| Other values (2) | 397 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5675 | |
| Uppercase Letter | 1135 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1033 | |
| n | 1006 | |
| i | 904 | |
| p | 609 | |
| g | 609 | |
| e | 424 | |
| t | 397 | 7.0% |
| m | 360 | 6.3% |
| u | 333 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 738 | |
| W | 295 | 26.0% |
| A | 102 | 9.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6810 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1033 | |
| n | 1006 | |
| i | 904 | |
| S | 738 | |
| p | 609 | |
| g | 609 | |
| e | 424 | |
| t | 397 | 5.8% |
| m | 360 | 5.3% |
| u | 333 | 4.9% |
| Other values (2) | 397 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6810 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 1033 | |
| n | 1006 | |
| i | 904 | |
| S | 738 | |
| p | 609 | |
| g | 609 | |
| e | 424 | |
| t | 397 | 5.8% |
| m | 360 | 5.3% |
| u | 333 | 4.9% |
| Other values (2) | 397 | 5.8% |
Interactions
Correlations
| Age (at enrolment) | CD4 cell count (cells/µL) | HIV viral load (copies/mL) | Sex | cd4_correction_applied | climate_daily_max_temp | climate_daily_mean_temp | climate_daily_min_temp | climate_heat_day_p90 | climate_heat_day_p95 | climate_heat_stress_index | climate_season | climate_temp_anomaly | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age (at enrolment) | 1.000 | -0.130 | -0.088 | 0.200 | 0.054 | 0.012 | 0.021 | 0.028 | 0.052 | 0.052 | 0.025 | 0.041 | -0.020 |
| CD4 cell count (cells/µL) | -0.130 | 1.000 | 0.031 | 0.168 | 1.000 | 0.038 | 0.042 | 0.018 | 0.000 | 0.000 | 0.008 | 0.000 | 0.033 |
| HIV viral load (copies/mL) | -0.088 | 0.031 | 1.000 | 0.099 | 0.488 | 0.082 | 0.097 | 0.074 | 0.000 | 0.000 | 0.035 | 0.056 | -0.010 |
| Sex | 0.200 | 0.168 | 0.099 | 1.000 | 0.000 | 0.000 | 0.000 | 0.042 | 0.000 | 0.000 | 0.027 | 0.000 | 0.000 |
| cd4_correction_applied | 0.054 | 1.000 | 0.488 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| climate_daily_max_temp | 0.012 | 0.038 | 0.082 | 0.000 | 0.000 | 1.000 | 0.883 | 0.647 | 0.998 | 0.998 | 0.859 | 0.760 | -0.037 |
| climate_daily_mean_temp | 0.021 | 0.042 | 0.097 | 0.000 | 0.000 | 0.883 | 1.000 | 0.900 | 0.998 | 0.998 | 0.672 | 0.738 | 0.220 |
| climate_daily_min_temp | 0.028 | 0.018 | 0.074 | 0.042 | 0.000 | 0.647 | 0.900 | 1.000 | 0.609 | 0.609 | 0.537 | 0.941 | 0.266 |
| climate_heat_day_p90 | 0.052 | 0.000 | 0.000 | 0.000 | 0.000 | 0.998 | 0.998 | 0.609 | 1.000 | 0.991 | 0.997 | 0.670 | 0.998 |
| climate_heat_day_p95 | 0.052 | 0.000 | 0.000 | 0.000 | 0.000 | 0.998 | 0.998 | 0.609 | 0.991 | 1.000 | 0.997 | 0.670 | 0.998 |
| climate_heat_stress_index | 0.025 | 0.008 | 0.035 | 0.027 | 0.000 | 0.859 | 0.672 | 0.537 | 0.997 | 0.997 | 1.000 | 0.933 | -0.295 |
| climate_season | 0.041 | 0.000 | 0.056 | 0.000 | 0.000 | 0.760 | 0.738 | 0.941 | 0.670 | 0.670 | 0.933 | 1.000 | 0.785 |
| climate_temp_anomaly | -0.020 | 0.033 | -0.010 | 0.000 | 0.000 | -0.037 | 0.220 | 0.266 | 0.998 | 0.998 | -0.295 | 0.785 | 1.000 |
Missing values
Sample
| study_source | Age (at enrolment) | Sex | primary_date | CD4 cell count (cells/µL) | HIV viral load (copies/mL) | cd4_correction_applied | final_comprehensive_fix_applied | waist_circ_unit_correction_applied | sa_biomarker_standards | climate_daily_mean_temp | climate_daily_max_temp | climate_daily_min_temp | climate_temp_anomaly | climate_heat_day_p90 | climate_heat_day_p95 | climate_heat_stress_index | climate_season | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3377 | JHB_Aurum_009 | 24.0 | Female | 2014-02-15 | 369.0 | 0.0 | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3378 | JHB_Aurum_009 | 38.0 | Female | 2014-04-09 | 701.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3379 | JHB_Aurum_009 | 21.0 | Male | 2014-08-12 | 654.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3380 | JHB_Aurum_009 | 29.0 | Male | 2014-04-29 | 350.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3381 | JHB_Aurum_009 | 35.0 | Female | 2013-04-29 | 324.0 | 0.0 | 0.0 | 1.0 | False | 1.0 | 17.799 | 25.800 | 10.493 | 10.025 | 0.0 | 0.0 | 19.958 | Autumn |
| 3382 | JHB_Aurum_009 | 22.0 | Male | 2014-06-26 | 276.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3383 | JHB_Aurum_009 | 38.0 | Female | 2013-11-19 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | 19.293 | 26.343 | 11.253 | 7.489 | 0.0 | 0.0 | 21.523 | Spring |
| 3384 | JHB_Aurum_009 | NaN | Male | 2014-09-08 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3385 | JHB_Aurum_009 | 22.0 | Female | 2013-08-24 | 525.0 | NaN | 0.0 | 1.0 | False | 1.0 | 9.356 | 17.553 | 2.343 | 6.918 | 0.0 | 0.0 | 13.639 | Winter |
| 3386 | JHB_Aurum_009 | 42.0 | Male | 2014-03-24 | 287.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| study_source | Age (at enrolment) | Sex | primary_date | CD4 cell count (cells/µL) | HIV viral load (copies/mL) | cd4_correction_applied | final_comprehensive_fix_applied | waist_circ_unit_correction_applied | sa_biomarker_standards | climate_daily_mean_temp | climate_daily_max_temp | climate_daily_min_temp | climate_temp_anomaly | climate_heat_day_p90 | climate_heat_day_p95 | climate_heat_stress_index | climate_season | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6118 | JHB_Aurum_009 | 23.0 | Male | 2013-07-17 | 174.0 | NaN | 0.0 | 1.0 | False | 1.0 | 13.868 | 21.347 | 7.436 | 10.271 | 0.0 | 0.0 | 14.306 | Winter |
| 6119 | JHB_Aurum_009 | 36.0 | Male | 2013-06-06 | 110.0 | NaN | 0.0 | 1.0 | False | 1.0 | 13.656 | 21.474 | 6.034 | 9.839 | 0.0 | 0.0 | 13.428 | Winter |
| 6120 | JHB_Aurum_009 | 29.0 | Male | 2014-06-17 | 393.0 | 0.0 | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6121 | JHB_Aurum_009 | 34.0 | Female | 2014-02-03 | 202.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6122 | JHB_Aurum_009 | 34.0 | Female | 2014-04-29 | 31.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6123 | JHB_Aurum_009 | 31.0 | Male | 2014-04-23 | 365.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6124 | JHB_Aurum_009 | 31.0 | Female | 2013-08-27 | 586.0 | NaN | 0.0 | 1.0 | False | 1.0 | 9.356 | 17.553 | 2.343 | 6.918 | 0.0 | 0.0 | 13.639 | Winter |
| 6125 | JHB_Aurum_009 | 65.0 | Male | 2014-08-14 | 409.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6126 | JHB_Aurum_009 | 28.0 | Male | 2014-08-04 | 455.0 | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6127 | JHB_Aurum_009 | 23.0 | Male | 2013-11-16 | 300.0 | NaN | 0.0 | 1.0 | False | 1.0 | 19.293 | 26.343 | 11.253 | 7.489 | 0.0 | 0.0 | 21.523 | Spring |
Duplicate rows
Most frequently occurring
| study_source | Age (at enrolment) | Sex | primary_date | CD4 cell count (cells/µL) | HIV viral load (copies/mL) | cd4_correction_applied | final_comprehensive_fix_applied | waist_circ_unit_correction_applied | sa_biomarker_standards | climate_daily_mean_temp | climate_daily_max_temp | climate_daily_min_temp | climate_temp_anomaly | climate_heat_day_p90 | climate_heat_day_p95 | climate_heat_stress_index | climate_season | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | JHB_Aurum_009 | 23.0 | Male | 2013-07-15 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | 13.868 | 21.347 | 7.436 | 10.271 | 0.0 | 0.0 | 14.306 | Winter | 2 |
| 1 | JHB_Aurum_009 | 32.0 | Female | 2014-03-29 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
| 2 | JHB_Aurum_009 | 37.0 | Female | 2014-10-28 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
| 3 | JHB_Aurum_009 | 39.0 | Male | 2014-08-12 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
| 4 | JHB_Aurum_009 | 49.0 | Male | 2014-04-02 | NaN | NaN | 0.0 | 1.0 | False | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |